OPTIMAL BINARIZATION OF GRAY-SCALED DIGITAL 
IMAGES VIA FUZZY REASONING 

CROSS REFERENCE TO RELATED APPLICATIONS 

s [0001] This application is related to an application entitled Image Edge Extraction 
Via Fuzzy Reasoning, which is commonly owned with the subject application and is to 
be filed under Docket Number KSC-12278. 
ORIGIN OF THE INVENTION 

[0002] The invention described herein was made in the performance of work 
10 under a NASA contract and is subject to the provisions of Public Law 96-51 7 (35 U.S.C. 
§202) in which the contractor has elected not to retain title. 
BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

[0003] The present invention relates in general to a method and system for 
15 converting gray scale images to binary images which employs fuzzy reasoning to 
calculate an optimal binarization threshold value. 

2. Description of the Background Art 

[0004] Conversion of gray-scale digital images to binary images is of special 
interest because an image in binary format can be processed with very fast logical 
20 (Boolean) operators by assigning a binary value to each of the image's pixels. A binary 
one value indicates that the pixel belongs to the image foreground, which may represent 
an object in the image, while a binary zero value indicates that the pixel is darker and 
belongs to the image's background. Since most image display systems and software 
employ gray-scale images of 8 or more bits per pixel, the binarization of these images 
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usually takes 2 extreme gray tones, black and white, which are ordinarily represented 
by 0 or 255, respectively, in an 8-bit gray-scale display environment. 
[0005] Image thresholding is the simplest image segmentation approach for 
converting a gray-scale image to a binary image. It is actually a pattern classification 

5 procedure in which only one input feature is involved, this being the pixel intensity value. 
Usually a binary image is obtained from an 8-bit gray-scale image by thresholding the 
image and assigning either the low binary value (0) or the high (255) value to all gray 
levels based on the chosen threshold. Obviously, the threshold that is chosen has a 
critical importance since it controls the binary-based pattern classification that is 

10 obtained from the gray-scale image. The key issue is to choose an optimal threshold so 
that the number of misclassified image pixels is kept as low as possible. Since images 
can differ substantially from one another depending on the objects contained therein, 
the optimal threshold value can vary considerably from one image to the next. Thus, 
merely selecting a threshold value that is, for example, set at the average pixel intensity 

15 value for the gray-scale image will probably not provide the optimal threshold. If the 
threshold is selected incorrectly, substantial image information will likely be lost in the 
conversion to binary. 

[0006] Numerous techniques have been employed to address the foregoing 
issue. The most accurate of these are non-interactive techniques that do not require 
20 selection of any process parameters to identify the optimal threshold. Such techniques 
automatically select the appropriate threshold based on an analysis of each image to be 
converted. An example of such a technique is disclosed by N. Otsu in A Threshold 
Selection Method From Gray-Level Histograms. IEEE Transaction on Systems, Man, 
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and Cybernetics, 9(1):62-66, (1979) (hereinafter referred to as the Otsu method). In the 
Otsu method, the optimal threshold is determined by minimizing the two variance 
classes; total variance and in-class variance. In other words, the means/averages of 
the two classes (background and foreground) should be as well separated as possible 
5 and the variances (standard deviation) in both classes should be as small as possible. 
The Otsu method is basically based on selecting the lowest point between the two 
classes. 

[0007] One particularly promising non-interactive approach is to employ fuzzy 
reasoning to determine the optimal threshold for binarization. Fuzzy reasoning is a 

10 logical reasoning technique that attempts to mimic more accurately how the human 
brain reasons. Under the fuzzy reasoning approach, a logic problem becomes more 
than deciding whether to assign a binary one or zero to a particular bit, pixel or 
parameter. Fuzzy reasoning goes one step further and recognizes that there is 
information contained in the degree to which a given value possesses a particular 

15 characteristic. For example, there is much less certainty that a particular pixel is in the 
background or foreground of the image if the pixel is very near a selected intensity 
threshold than if the value were far below or above the threshold. In a fuzzy reasoning 
approach, a multiple pixel digital image is defined as an array of fuzzy singletons, each 
having a membership value somewhere between 0.0 and 1 .0 that denotes its degree of 

20 possessing some property (e.g., brightness, darkness, edginess, blurredness, texture 
etc.). For image binarization, the membership function is defined in terms of the degree 
a pixel having a particular gray level value in the image belongs to one of the two binary 
classes, background and foreground. 
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[0008] Once the membership function is formed, the function can be employed to 
determine the optimum threshold value that defines the boundary between background 
and foreground gray levels. This is accomplished by identifying the threshold value . 
which results in the membership function providing the minimum fuzzy entropy for the 

5 image. The concept of fuzzy entropy is generally defined in information theory as a 
measure of information. In the context of fuzzy reasoning, the entropy is a measure of 
the degree of fuzziness. Thus, in the image binarization application, the goal is to select 
a threshold value that results in the minimum fuzziness or uncertainty. 
[0009] An example of the use of fuzzy reasoning in image binarization is the 

10 method disclosed by Huang and Wang in Image Thresholding by Minimizing the 
Measures of Fuzziness, Pattern Recognition, Vol. 28, No. 1, pp 41-51 (1995) 
(hereinafter referred to as the Huang-Wang method). In the Huang-Wang method, a 
triangular membership function for the foreground and background classes is employed 
in which the graph of the function appears as two adjacent triangles that join at a 

15 selected threshold value. The peak values of the triangles occur at the average pixel 
intensity level for each class, where the membership value is 1 .0. To identify the 
optimal threshold, an iterative trial and error technique is employed to identify the 
threshold that results in the minimum fuzzy entropy for the membership function. 
Shannon's entropy function, which is a logarithmic function in the shape of a parabola, 

20 is used as an entropy factor or cost function to calculate the entropy measure for a 
selected threshold. The threshold value that results in the minimum fuzzy entropy is 
then selected as the optimal threshold for binarization of the image. 
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[0010] Although the Huang-Wang method is fairly accurate and selects image 
thresholds that in general result in preservation of more image information than more 
conventional techniques, this increased conversion accuracy comes at the expense of 
substantially more computational power and execution time. For example, In tests 

5 comparing the Huang-Wang method to the Otsu method, the Huang-Wang typically 
required approximately 3 times the execution time than that of the Otsu method. The 
extended execution time is primarily due to the logarithmic nature of Shannon's entropy 
function which complicates the necessary calculations. In addition, use of Shannon's 
function restricts the values of the membership function to a range of 0.5 to 1 .0, which 

io limits accuracy. The limited range is necessary because the parabolic shape of 

Shannon's function has increasing values between membership values of 0.0 and 0.5, 
and decreasing values between membership values of 0.5 and 1 .0. However, because 
the cost or entropy should decrease as the membership function value increases (as 
the fuzziness becomes smaller), the membership values below 0.5 cannot be employed 

15 when Shannon's function is selected as the entropy measure function. As a result of 
the foregoing, there is a need for a fuzzy reasoning based binarization technique that 
can operate effectively with reduced execution times. 
SUMMARY OF THE INVENTION 

[001 1] The present invention addresses the foregoing need through provision of 
20 an improved computational technique based on fuzzy entropy measure for finding an 
optimal binary image threshold for binarization. This new technique provides 
substantial improvements both in execution speed and accuracy over the previously 
discussed Huang-Wang method. As in the Huang-Wang method, the new method 



5 



employs a triangular membership function which is dependent on the degree to which 
the pixels in the image belong to either the foreground class or the background class. 
However, the membership function in the subject invention differs from that in the 
Huang-Wang method in two notable ways. First, the membership values vary fully from 

5 0.0 to 1 .0 which improves measurement accuracy. Second, the membership function 
employs lower and upper bound gray level limits which are selected to be equal to the 
minimum and the maximum gray levels, respectively, that are present in the image to be 
converted. This also improves accuracy, especially where the image to be converted 
does not include pixel values at either end of the gray level spectrum. For example, 

10 predominantly dark images can be more accurately converted by eliminating the lighter 
gray level values that are not present in the image from the membership function. 
[0012] The membership function can include values from 0.0 to 1 .0 because a 
simplified fuzzy entropy function is employed that decreases for all membership values 
between 0.0 and 1 .0. The entropy function is linear and is defined as 1 minus the 

15 membership value for each gray level. The use of a linear entropy factor function also 
simplifies the calculations that are necessary to determine the fuzzy entropy for each 
possible threshold value. As a result, execution times for the subject technique are 
typically on the order of 3 or more times shorter than the execution times using the 
Huang-Wang method. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

[0013] The features and advantages of the present invention will become 
apparent from the following detailed description of a preferred embodiment thereof, 
taken in conjunction with the accompanying drawings, in which: 
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[0014] FIG. 1 is a block diagram of a computer system for converting gray scale 
images to binary images using a fuzzy reasoning based intensity threshold 
determination technique in accordance with the preferred embodiment of the present 
invention; 

5 [0015] FIG. 2 is a flowchart showing the steps carried out by the threshold 
determination technique of the preferred embodiment; 

[0016] FIG. 3 is a graph illustrating a membership function employed in the 
threshold determination technique of the preferred embodiment; 
[0017] FIG. 4 is a graph illustrating an entropy factor function that is employed in 
10 the threshold determination technique of the preferred embodiment; 
[0018] FIG. 5A is a gray-scale image to be converted to binary; 
[0019] FIG. 5B is a binary representation of the image of FIG. 5A that has been 
obtained using the known Otsu method; 

[0020] FIG. 5C is a binary representation of the image of FIG. 5A that has been 

15 obtained using the known Huang-Wang method; and, 

[0021] FIG. 5D is a binary representation of the image of FIG. 5A that has been 
obtained using the binarization technique of the preferred embodiment. 
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
[0022] With reference to FIG. 1 , a computer system 10 is illustrated which is 

20 configured to convert multiple bit gray-scale digital images into binary images using an 
image thresholding determination algorithm in accordance with a preferred embodiment 
of the present invention. The system 10 includes a processor 12 which is interfaced to 
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an operating memory 14 and a storage memory 16, as is conventional. Loaded into the 
operating memory 14 is a binarization program or software module 18. 
[0023] Gray-scale images to be converted to binary are either retrieved from the 
storage memory 16 or from an external image source 20 and are fed into the 

5 binarization program 18, which performs the conversion. To accomplish this, the 

binarization program includes a threshold determination algorithm or subroutine 22 that 
identifies the optimal threshold to be employed for converting the gray-scale image. 
The threshold T is selected by the algorithm 22 on an image by image basis and defines 
the gray level above which any pixels will be assigned a binary one as belonging to a 

10 first intensity related characteristic class (e.g. the foreground) of the image and at or 
below which, any pixels will be assigned a binary zero as belonging to a second 
intensity related characteristic class (e.g. the background) of the image. It should be 
noted that in the case of foreground and background classes, these two can be 
interchanged depending on whether a black or a white background is employed in the 

15 image. 

[0024] The threshold determination algorithm implements a computational 
technique that is based on fuzzy entropy measure and is designed to find an optimal 
binary image threshold without external parameter input. Under the fuzzy reasoning 
approach, a digital image to be converted is defined as an array of fuzzy singletons, 
20 each having a membership value denoting its degree of either being in the foreground 
or the background of the image. Under this assumption, an image I can be represented 
as: 

[0025] I-[f(x,y),|i I (f(x,y))] (1) 
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[0026] In the preferred embodiment, the membership function Mi(f(*.y)) is defined 
in terms of the degree that a pixel (x,y) in the image belongs to one of the two binary 
classes, background and foreground. The respective membership function in each of 
these two classes is built based on the average gray level of the pixels in each class, 

5 which is computed using the gray-level histogram as an average weight factor. Thus, 
the first step 100 of the process, as illustrated in the flowchart of FIG. 2, is to calculate 
the histogram H for the image to be converted by identifying the number of pixels in the 
image having each of the L possible gray levels z. Next, at step 102, the lowest gray 
level, MinZ, and the highest gray level, MaxZ, in the image are noted and used with the 

10 histogram information and a first selected intensity threshold T to calculate the average 
gray level for each of the two classes, background and foreground, using equations 2 
and 3, respectively: 

[0027] G x (T) = £ [zH{z)\ / £ H(z) (2) 

MinZ MinZ 
MaxZ MaxZ 

[0028] G 2 (7-)=£[z//(z)]/2>(z) (3) 

r+i r+i 

15 [0029] In these equations the Domain is defined as: 0 < MaxZ, MinZ, T, and z £ 
L-1; where T = Binarization threshold value; z =gray level; MinZ = lowest gray level 
holding a nonzero histogram value; MaxZ = highest gray level holding a nonzero 
histogram value; L = total gray-level values (e.g. for an 8-bit image, L = 2 8 = 256); H(z) = 
Image histogram value of gray level z; Gi(T) = average gray-level value for class 1 

20 (background); and, G2(T) = average gray-level value for class 2 (foreground). 
[0030] Once the foregoing values are determined, the next step 1 04 of the 
process is to form the membership function. The membership function is a linear- 



l\ V I* 



[0031] ^(z) = 



(4-7) 



triangular-type at each one of the two classes (background and foreground) and is 
defined by the equations 4-7. A graph of the resulting function is illustrated in FIG. 3. 

f [z -MinZ\[G x (T)-MinZ\ if MinZ <z<G x (t)} 

[T-z\[T-GXT)] if G x {T)<z<T 

[z-T\[G 2 {t)-T] if T<z<G 2 {T) 

^MaxZ - z]/[MaxZ - G 2 (t)] if G 2 (T) < z < MaxZ ) 

[0032] The membership function is thus made up of two triangular sections that 
5 are separated from each other by the initially selected threshold value T. The section at 
or below the threshold T represents pixels that belong to the background (binary 0), 
while the section above the threshold represents the pixel values that belong to the 
foreground class (binary 1 ). 

[0033] The technique of the preferred embodiment is an iterative one in which no 
10 parameters are required to be entered. In the preferred embodiment, an initial threshold 
of T= MinZ + 4 is arbitrarily selected as a starting point and this value is incremented by 
1 until all possible thresholds up to MaxZ - 2 have been tried to determine which one 
results in the minimum fuzzy entropy. It should be noted that the range and number of 
thresholds tested can be selected to be any number desired, although in general, the 
15 more possible thresholds that are tested, the more accurate the results. 

[0034] A modified fuzzy entropy measure of an image is used as the cost function 
for the selection of the optimal image threshold needed to determine the image pixels 
that belong to either the background or the foreground of the image. The concept of 
fuzzy entropy is generally defined in information theory as a measure of information. 
20 The image entropy measures the amount of information an image contains using the 
histogram information and its respective entropy factor that it is built as a function of the 
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triangular-type membership function. The entropy measure function is defined in 
equation 8: 

MaxZ 

[0035] S(T) = {l/[MNlog2]} ^H^e^ (z)] (8) 

MinZ 

[0036] Where, S(T) = fuzzy entropy measure; M = Image rows (number of 
5 horizontal pixels); N = image columns (number of vertical pixels); and, Se[ui(z)] = fuzzy 
entropy factor function. 

[0037] A simple negative slope linear function defined as Se[ui(z)] = 1 -ui(z) is 
selected to calculate the entropy factor since the entropy measure should decrease as 
the membership value increases (as the fuzziness becomes smaller). Se[ui(z)] is 

10 illustrated in FIG. 4. 

[0038] Thus, once the membership function is determined, the next step 106 of 
the process is to calculate the entropy factor function Se[ui(z)] from the membership 
function. Next, the fuzzy entropy measure S(T) is calculated at step 108 using equation 
8. The program next determines whether all thresholds have been evaluated at step 

15 110. If not, a new threshold is selected by incrementing T at step 112 and the process 
of steps 102-108 is repeated. 

[0039] Once all thresholds have been evaluated, the final step 1 14 of the process 
is to select the optimal threshold value T OPTIMAL which is the one of the selected 
thresholds that results in a minimum fuzzy entropy measure, that is: 
20 [0040] T OPTIMAL = arg min S(t) where MinZ < T < MaxZ (9) 

[0041] The fuzzy entropy measure S(T) has the following properties. S(T) is 
large if many pixels have membership close to 0.0 or their gray levels are far from their 
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class average gray levels. It has a maximum value of 1 if all membership values are 
equal to 0.0. S(T) is small if many pixels have membership values close to 1 or their 
gray levels are close to their class average gray levels. It has a minimum value 0 if all 
membership values are equal to 1 . S(T ') < S(T 2 ) if image 1 1 with S(T ') is crisper 

5 (less fuzzy) than image 1 2 with S(T 2 ). In this case, 1 1 has pixel gray levels distributed 
more compactly around the two class average gray levels than 1 2 
[0042] In each of the two binary image classes, background and foreground, the 
membership value equal to 1 is the largest at the class average gray level and reduces 
its value as low as 0 when the difference between the pixel gray level and its class 

10 average level increases. This means that pixels with gray levels close to their 

corresponding class average gray levels have less fuzziness or ambiguity and thus can 
be classified with greater confidence than pixels with gray levels far from their class 
gray levels. The image entropy measure is used as a cost function to find the optimal 
threshold (equation 9). It is defined using the histogram information as shown in 

15 equation 8. The entropy factor needed to compute the entropy measure is calculated 
using the simple and fast computational linear function of FIG. 4. 
[0043] As discussed previously, the proposed method uses a similar but more 
efficient and faster computational approach than the one used in Huang-Wang method. 
The Huang-Wang method uses a symmetric membership function that includes all 

20 possible gray level values, while the proposed approach uses a more realistic 
membership function having the highest and lowest gray levels holding nonzero 
histogram values of the image to be converted as the domain limits. The subject 
method also does not restrict the range of membership values and uses a straight-line 
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cost function that requires much less computational power than the Shannon function 
used by Huang-Wang method. 

[0044] To demonstrate the effectiveness of the subject binarization technique, 
tests were conducted to compare the subject technique to the prior Otsu and Huang- 
Wang methods. The results of these tests are illustrated in FIGs. 5A-5D. FIG. 5A 
shows the gray scale image to be converted to binary, while FIGs. 5B-5D show the 
resulting binary images using Otsu, Huang-Wang, and the subject methods, 
respectively. As can readily be observed, the subject method does a much better job of 
filtering out extraneous background material from the image than either of the two prior 
techniques. More telling is the execution time. The Otsu method represented by FIG. 
5B required a respectable 1 .5 milliseconds to convert the image, while the Huang-Wang 
method took a much longer 10.8 milliseconds. The subject method took 2.0 
milliseconds, only slightly longer than the non-fuzzy reasoning based Otsu method and 
less than 175 th the time of the Huang-Wang method. Thus, for images with textured 
background and poor printing quality, the subject method has a consistently better 
overall binarization performance than Huang-Wang and Otsu methods. 
[0045] Although the invention has been disclosed in terms of a preferred 
embodiment, it will be understood that modifications and variations could be made 
thereto without departing from the scope of the invention as set forth in the following 
claims. 
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